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The origins of Ashkenazi Jews remain highly controversial. Like Judaism, mitochondrial DNA 
is passed along the maternal line. Its variation in the Ashkenazim is highly 
distinctive, with four major and numerous minor founders. However, due to their rarity in the 
general population, these founders have been difficult to trace to a source. Here we show that 
all four major founders, ~40% of Ashkenazi mtDNA variation, have ancestry in prehistoric 
Europe, rather than the Near East or Caucasus. Furthermore, most of the remaining minor 
founders share a similar deep European ancestry. Thus the great majority of Ashkenazi 
maternal lineages were not brought from the Levant, as commonly supposed, nor recruited in 
the Caucasus, as sometimes suggested, but assimilated within Europe. These results point to 
a significant role for the conversion of women in the formation of Ashkenazi communities, 
and provide the foundation for a detailed reconstruction of Ashkenazi genealogical history. 
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living Jews—remain highly contested and enigmatic to 

this day!~!!. The Ashkenazim are Jews with a recent 
ancestry in central and Eastern Europe, in contrast to Sephardim 
(with an ancestry in Iberia, followed by exile after 1492), 
Mizrahim (who have always resided in the Near East) and 
North African Jews (comprising both Sephardim and Mizrahim). 
There is consensus that all Jewish Diaspora groups, including 
the Ashkenazim, trace their ancestry, at least in part, to the 
Levant, ~2,000-3,000 years ago!?-!4. There were Diaspora 
communities throughout Mediterranean Europe and the Near 
East for several centuries prior to the destruction of the Second 
Temple in Jerusalem in 70 CE (Common Era), and some scholars 
suggest that their scale implies proselytism and wide-scale 
conversion, although this view is very controversial”!. 

The Ashkenazim are thought to have emerged from dispersals 
north into the Rhineland of Mediterranean Jews in the early 
Middle Ages, although there is little evidence before the twelfth 
century*!>. After expulsions from Western Europe between the 
thirteenth and fifteenth centuries, the communities are thought to 
have expanded eastwards, especially in Poland, Lithuania and 
then Russia. The implied scale of this expansion has led some to 
argue, again very controversially, for mass conversions in the 
Khazar kingdom, in the North Caucasus region to the north and 
east of the Black Sea, following the Khazar leadership’s adoption 
of Judaism between the ninth and tenth centuries CE8”. 

We are then faced with several competing models for 
Ashkenazi origins: a Levantine ancestry; a Mediterranean/west 
European ancestry; a North Caucasian ancestry; or, of course, a 
blend of these. This seems an ideal problem to tackle with genetic 
analysis, but after decades of intensive study a definitive answer 
remains elusive. Although we might imagine that such an 
apparently straightforward admixture question might be readily 
addressed using genome-wide autosomal markers, recent studies 
have proposed contradictory conclusions. Several suggest a 
primarily Levantine ancestry with south/west European admix- 
ture*4, but another concludes that the ancestry is largely 
Caucasian!®, implying a major source from converts in the 
Khazar kingdom’. An important reason for disagreement is that 
the Ashkenazim have undergone severe founder effects during 
their history, drastically altering the frequencies of genetic 
markers and distorting the relationship with their ancestral 
populations. 

This problem can be resolved by reconstructing the relation- 
ships genealogically, rather than relying on allele frequencies, 
using the non-recombining marker systems: the paternally 
inherited male-specific part of the Y chromosome (MSY) and 
the maternally inherited mitochondrial DNA (mtDNA). This 
kind of analysis can be very powerful, because nesting of 
particular lineages within clusters from a particular geographical 
region allows us to pinpoint the source for those lineages, by 
applying the parsimony principle. This has indeed been 
attempted, with the MSY results interpreted plausibly to suggest 
an overwhelming majority of Near Eastern ancestry on the 
Ashkenazi male line of descent!!!8-!, albeit with much higher 
levels (> 50%) of European (potentially east European) lineages 
in Ashkenazi Levites**, suggesting a possible Khazar source in 
that particular case. 

The maternal line has also been studied, and indeed Ashkenazi 
mtDNAs are highly distinctive, but they have proved difficult to 
assign to a source population! *!!, Some progress has been made 
by targeting whole-mtDNA genomes or mitogenomes, which 
provide much higher genealogical (and therefore geographical) 
and chronological resolution than the control-region sequences 
used previously—although the far larger control-region database 
remains an invaluable guide to their geographic distribution. 


iE: origins of Ashkenazi Jews—the great majority of 


Using this approach, Behar et al.? identified four major founder 
clusters, three within haplogroup K—amounting to 32% of 
sampled Ashkenazi lineages—and one within haplogroup N1b, 
amounting to another 9%. These lineages are extremely 
infrequent across the Near East and Europe, making the 
identification of potential source populations very challenging. 
Nevertheless, they concluded that all four most likely arose in the 
Near East and were markers of a migration to Europe of people 
ancestral to the Ashkenazim only ~2,000 years agot. The 
remaining ~60% of mtDNA lineages in the Ashkenazim 
remained unassigned to any source, with the exception of the 
minor haplogroup U5 and V lineages (~6% in total), which 
implied European ancestry), 

Here we focus on both major and minor founders, with a much 
larger database from potential source populations. We first 
analyse 956 (72 newly generated) mitogenomes from haplogroup 
U8 (including 909 from haplogroup K, U8’s major subclade): 477 
of these are from Europe and 106 from the Near East/Caucasus. 
We show that European and Near Eastern lineages largely fall 
into discrete, ancient clusters, with minor episodes of gene flow, 
suggesting that haplogroup K diversified separately in Europe 
and the Near East during the last glacial period. Of the three 
Ashkenazi founders, Klalbla and Kla9 were most likely 
assimilated in west (perhaps Mediterranean) Europe and 
K2a2al in west/central Europe. Most surprisingly, by analysing 
two new N1b2 sequences selected from a database of 278 N1b 
HVS-I sequences, in the context of 44 published N1b sequences”*, 
we show that the highly distinctive N1b2 subclade, making up 
another 9% of Ashkenazi lineages, was likely assimilated in 
Mediterranean Europe, rather than in the Near East as previously 
proposed’. Moreover, from a survey of another > 2,500 complete 
mtDNA genomes and > 28,000 control-region sequences from 
Europe, the Near East and the Caucasus, in comparison with the 
available database of 836 Ashkenazi control-region sequences and 
a handful of published mitogenomes, we also evaluate the minor 
founders. Overall, we estimate that most (>80%) Ashkenazi 
mtDNAs were assimilated within Europe. Few derive from a Near 
Eastern source, and despite the recent revival of the ‘Khazar 
hypothesis’!®, virtually none are likely to have ancestry in the 
North Caucasus. Therefore, whereas on the male side there may 
have been a significant Near Eastern (and possibly east European/ 
Caucasian) component in Ashkenazi ancestry, the maternal 
lineages mainly trace back to prehistoric Western Europe. These 
results emphasize the importance of recruitment of local women 
and conversion in the formation of Ashkenazi communities, and 
represent a significant step in the detailed reconstruction of 
Ashkenazi genealogical history. 


Results 

Four major founder lineages within haplogroup K and N1b. 
Haplogroup K arose within haplogroup U8 ~ 36 ka, in Europe or 
the Near East, with the minor subclades K1b, K1c and K2 all most 
likely arising in Europe, between the last glacial period and the 
Neolithic (Fig. 1; Supplementary Note 1; Supplementary Data 1-3; 
Supplementary Figs $1-S3; Supplementary Tables S1-S3). Kla 
expanded from ~20ka onwards, both in the Near East and 
Europe, with its major subclade, Klalb1 (Fig. 2), mainly restricted 
to Europe (with a few instances in North Africa), arriving from 
the Near East by ~11.5ka, the beginning of the Holocene 
(Supplementary Note 1). 

Almost half of mtDNAs in west/central European Ashkenazi 
Jews belong to haplogroup K, declining to ~15% in east 
European Jews!!!, with almost all falling into three subclades: 
Klalbla, Kla9 and K2a2al?> (Figs 1-4; Supplementary Fig. $4). 
These three founder clusters show a strong expansion signal 
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Figure 1 | Inferred ancestry of the main subclades within haplogroup U8. The timescale (ka) is based on ML estimations for mitogenomes. Inset: 
Bayesian skyline plot of 34 Ashkenazi haplogroup K lineages, showing growth in effective population size (Nep) over time. 
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Figure 2 | Phylogenetic tree of haplogroup K1a1b1. Time scale (ka) based on ML estimations for mitogenome sequences. 


beginning ~2.3 ka, with the overall effective population size for 
these lineages increasing 13-fold by 275 years ago (Fig.1). 
Klalbla (slightly re-defined, due to the improved resolution of 
the new tree) (Fig. 2) accounts for 63% of Ashkenazi K lineages 
(or ~20% of total Ashkenazi lineages) and dates to ~ 4.4 ka with 
maximum likelihood (ML); however, all of the samples within it, 
except for one, nest within a further subclade, Klalblal, dating 
to ~2.3ka (Supplementary Data 2). Klalb1al is also present in 
non-Ashkenazi samples, mostly from central/east Europe. As they 
are nested by Ashkenazi lineages, these are likely due to gene flow 


from Ashkenazi communities into the wider population. The 
pattern of gene flow out into the neighbouring communities is 
seen in the other two major K founders, and also in haplogroups 
H and J; it is especially clear when the nesting and nested 
populations are more distinct, for example in the case of 
haplogroup HV1b, which has a deep ancestry in the Near East 
(Fig. 5; Supplementary Table S4). 

The K1la1b1 lineages within which the Klalbla sequences nest 
(including 19 lineages of known ancestry) are solely European, 
pointing to an ancient European ancestry. The closest nesting 
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Figure 3 | Phylogenetic tree of haplogroup K1a9 in the context of the putative clade K1a9'10'15'26’30. Time scale (ka) based on ML estimations for 
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Figure 4 | Phylogenetic tree of haplogroup K2a2. Time scale (ka) based 
on ML estimations for mitogenome sequences. 


lineages are from Italy, Germany and the British Isles, with other 
subclades of Klalb1 including lineages from west and Mediter- 
ranean Europe and one Hutterite (Hutterites trace their ancestry 
to sixteenth-century Tyrol)?°. Typing/HVS-I results have also 
indicated several from Northwest Africa, matching European 
HVS-I types”, likely the result of gene flow from Mediter- 
ranean Europe. Klalbla is also present at low frequencies in 


Spanish-exile Sephardic Jews, but absent from non-European 
Jews, including a database of 289 North African Jews*”>. Notably, 
it is not seen in Libyan Jews*°, who are known to have a distinct 
Near Eastern ancestry, with no known influx from Spanish-exile 
immigrants (although Djerban Jews, with a similar history, have 
not been tested to date for mtDNA, they closely resemble Libyan 
Jews in autosomal analyses”’). Thus the Ashkenazi subclade of 
Klalb1 most likely had a west European source. 

Kla9 (Fig. 3; Supplementary Fig. $4), accounting for another 
20% of Ashkenazi K lineages (or 6% of total Ashkenazi lineages) 
and also dating to ~2.3ka with ML (Supplementary Data 2) 
again includes both Ashkenazi and non-Ashkenazi lineages solely 
from east Europeans (again suggesting gene flow out into the 
wider communities). Like Klalbla, it is also found, at much 
lower frequencies, in Sephardim. Here the ancestral branching 
relationships are less clear (Supplementary Note 1 and 
Supplementary Fig. $4), but Kla9 is most plausibly nested within 
the putative clade K1a9‘10'15/26'30, dating to ~9.8ka, which 
otherwise includes solely west European (and one Tunisian) 
lineages, again pointing to a west European source. 

K2a2 (Fig. 4) accounts for another 16% of Ashkenazi 
K lineages (or ~5% of total Ashkenazi lineages) and dates to 
~8Aka (Supplementary Data 2). Ashkenazi lineages are once 
more found in a shallow subclade, K2a2a1, dating to ~1.5ka, 
that otherwise again includes only east Europeans, suggesting 
gene flow from the Ashkenazim. Conversely, the nesting clades, 
K2a2 and K2a2a, although poorly sampled, include only French 
and German lineages. K2a2a is not found in non-European 
Jews?°. 

Haplogroup K is rarer in the North Caucasus than in Europe or 
the Near East (<4% (ref. 23)) and the three Ashkenazi founder 
clades have not been found there (Supplementary Note 2). We 
tested all eight K lineages out of 208 samples from the North 
Caucasus, and all belonged to the Near Eastern subclades K1a3, 
K1la4 and Klal2. Haplogroup K is more common in Chuvashia, 
but those sampled belong to K1a4, Kla5 and pre-K2a8. 

The fourth major Ashkenazi founder mtDNA falls within 
haplogroup N1b (ref. 2). The distribution of N1b is much more 
focused on the Near East than that of haplogroup K (ref. 24), 
and the distinctive Ashkenazi N1b2 subclade has accordingly 
being assigned to a Levantine source. N1b2 has until now been 
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Figure 5 | Phylogenetic tree of haplogroup HV1b. Time scale (ka) based on ML estimations for mitogenome sequences. 


found exclusively in Ashkenazim, and although it dates to only 
~2.3ka, it diverged from other N1b lineages ~20ka (ref. 24) 
(Supplementary Table S5). N1b2 can be recognized in the HVS-I 
database by the variant 16176A, but Behar et al.” tested 14 Near 
Eastern samples (and some east Europeans) with this motif and 
identified it as a parallel mutation. Therefore, despite the long 
branch leading to N1b2, no Near Eastern samples are known to 
belong to it. 

In our unpublished database of 6991 HVS-I sequences, 
however, we identified two Italian samples with the 16176A 
marker, which we completely sequenced. We confirmed that they 
belong to N1b2 but diverge before the Ashkenazi lineages ~5 ka, 
nesting the Ashkenazi cluster (Fig. 6; Supplementary Table S5). 
This striking result suggests that the Italian lineages may be relicts 
of a dispersal from the Near East into Europe before 5 ka, and that 
N1b2 was assimilated into the ancestral Ashkenazi population on 
the north Mediterranean ~2ka. Although we found only two 
samples suggesting an Italian ancestry for N1b2, the control- 
region database available for inspection is very large (28,418 
HVS-I sequences from Europe, the Near East and the Caucasus, 
of which 278, or ~1%, were N1b). Moreover, the conclusion is 
supported by our previous founder analysis of Nlb HVS-I 
sequences, which dated the dispersal into Europe to the late 
Pleistocene/early Holocene**. 


Minor Ashkenazi mtDNA lineages. There is now a large 
number of mitogenomes from Europe, the Caucasus and the 
Near East (~3,500, with >70 Ashkenazim), and a substantial 
Ashkenazi mtDNA control-region database of 836 samples”! 
(Supplementary Table S6). We therefore endeavoured to cross- 
reference the two in order to pinpoint most of the control-region 
data within the mitogenome phylogeny. 

Besides the four haplogroup K and N1b founders, the major 
haplogroup in Ashkenazi Jews is haplogroup H, at 23% of 
Ashkenazi lineages, which is also the major haplogroup in 


Europeans (40-50% in Europe, ~ 25% in the North Caucasus and 
~19% in the Near East)?®. There are 29 Ashkenazi H 
mitogenomes available (Supplementary Table S7), 26 (90%) of 
which nest comfortably within European subclades dating to the 
early Holocene (Supplementary Note 3, Figs 7 and 8; 
Supplementary Figs S5-S10; Supplementary Table S8). Most, in 
fact, nest more specifically within west/central European 
subclades, with closely matching sequences in east Europe, as 
with the pattern for the K founder clades. The Ashkenazi 
mitogenomes from haplogroup H include 39% belonging to H1 
or H3, which are most frequent in west Europe and rare outside 
Europe. The nesting relationships in some cases point (albeit 
tentatively) to a central European source, but in many cases 
comparison with the HVS-I database indicates matches in west 
Europe. The phylogeographic conclusions based on the nesting 
relationships are strongly supported for haplogroup H by 
evidence from the study of prehistoric remains, showing in 
almost all cases that the lineages concerned were present in 
Europe since at least the early Bronze Age, ~3.5ka 
(Supplementary Table $7)?*. There is no suggestion of 
assimilation from the North Caucasus, where most H lineages 
differ from those of Europe”? (Supplementary Note 2). 

Haplogroup J comprises 7% of the Ashkenazi control-region 
database. Around 72% of these can be assigned to Jlc, now 
thought to have arisen within Late Glacial Europe*°, and 19% 
belong to J1blal, also restricted to Europe. Thus >90% of the 
Ashkenazi J lineages have a European origin, with ~ 7% (J1b and 
J2b) less clearly associated. Many have a probable west/central 
European source, despite (like H) being most frequent in eastern 
Ashkenazim. The four Ashkenazi J mitogenomes, in J1c5, Jlc7ala 
and Jlc7d, once again show a striking pattern of Mediterranean, 
west and central European lineages enclosing Ashkenazi/east 
European ones (Fig. 9). 

Haplogroups U5, U4 and HVO (6.3% between them overall) 
arose within Europe. Some of these lineages, which are again 
more frequent in the eastern than western Ashkenazi, may have 
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Figure 6 | Phylogenetic tree of haplogroup N1b. Time scale (ka) based on ML estimations for mitogenome sequences. 


been assimilated in central Europe. The haplogroup T lineages 
(5% overall) are more difficult to assign, but at least 60% (in 
T2alb, T2b, T2el and T2e4) are likely of European and ~ 10% 
(T1b3 and T2a2) Near Eastern origin. The haplogroup I 
lineages have evidently been present in Europe at least since the 
Neolithic, as indicated by both phylogeographic and ancient 
DNA analyses*!. Haplogroup W3 may have originated in the 
Near East but spread to Europe as early as the Late Glacial?!. The 
Mlalb lineage is characteristic of the north Mediterranean and 
was most likely assimilated there**, but the U6a and L2all 
lineages are more difficult to pin down. 

The main lineages with a potentially Near Eastern source 
include HV1, ROala and U7a5 (~8.3% in all). HV1b2 mito- 
genomes, in particular, date to ~2ka and nest within a cluster 
of Near Eastern HV1b lineages dating to ~18ka (Fig. 5; 
Supplementary Table S4). Others such as Ula and U1b have an 
ultimately Near Eastern origin but, like N1b, have been 
subsequently distributed around the north Mediterranean. In 
general, it is more difficult to assign lineages to a Near Eastern 
source with confidence, as the much larger control-region 
database indicates that (as with N1b2) many lineages with deep 
Near Eastern ancestry became widely dispersed along the north 
Mediterranean during the Holocene, and may alternatively have 
been assimilated there. 

If we allow for the possibility that K1a9 and N1b2 might have a 
Near Eastern source, then we can estimate the overall fraction of 
European maternal ancestry at ~ 65%. Given the strength of the 
case for even these founders having a European source, however, 
our best estimate is to assign ~81% of Ashkenazi lineages to a 
European source, ~8% to the Near East and ~1% further to 
the east in Asia, with ~10% remaining ambiguous (Fig. 10; 


Supplementary Table S9). Thus at least two-thirds and most likely 
more than four-fifths of Ashkenazi maternal lineages have a 
European ancestry. 


Discussion 

The extent to which Ashkenazi Jewry trace their ancestry to the 
Levant or to Europe is a long-standing question, which remains 
highly controversial>+°!2-!416!7, Our results, primarily from the 
detailed analysis of the four major haplogroup K and N1b 
founders, but corroborated with the remaining Ashkenazi 
mtDNAs, suggest that most Ashkenazi maternal lineages trace 
their ancestry to prehistoric Europe. 

Previous researchers proposed a Levantine origin for the three 
Ashkenazi K founders from several indirect lines of evidence: 
shared ancestry with non-Ashkenazi Jews, shared recent ancestry 
with Mediterranean samples, and their absence from amongst 
non-Jews’, and this suggestion has been widely accepted’. 
However, our much more detailed analyses show that two of 
the major Ashkenazi haplogroup K lineages, Klalbla and K2a2al 
have a deep European ancestry, tracing back at least as far as the 
early and mid-Holocene respectively. They both belong to ancient 
European clades (Klalbl and K2) that include primarily 
European mtDNAs, to the virtual exclusion of any from the 
Near East. Despite some uncertainty in its ancestral branching 
relationships, a European ancestry seems likely for the third 
founder clade, K1a9. The heavy concentration of Near Eastern 
haplogroup K lineages within particular, distinct subclades of the 
tree, and indeed the lack of haplogroup K lineages in Samaritans, 
who might be expected to have shared an ancestral gene pool with 
ancient Israelites, both strongly imply that we are unlikely to have 
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Figure 7 | Schematic phylogenetic tree of haplogroup H1. Only the Ashkenazi lineages are shown in full detail; the distribution of other lineages is 
indicated using small squares by the number present in the full tree for each subclade. Prehistoric European (all Neolithic, except for the Hlaw lineage, 


which dates to the Iron Age) lineages are shown using red circles2?. 
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Figure 10 | Estimated contributions of European mtDNA lineages 

to the Ashkenazi mtDNA pool shown by major haplogroup. The possible 
overall Near Eastern contribution and fraction of unassigned lineages 

are also indicated. 


missed a hitherto undetected Levantine ‘reservoir of haplogroup 
K variation (Supplementary Note 1). 

Furthermore, our results suggest that N1b2, for which a Near 
Eastern ancestry was proposed (with much greater confidence 
than for K) by Behar et al?, is more likely to have been 
assimilated into the ancestors of the Ashkenazi in the north 
Mediterranean. Finally, our cross-comparison of control-region 
and mitogenome databases shows that the great majority 
of the remaining ~60% of Ashkenazi lineages, belonging to 
haplogroups H, J, T, HV0, U4/U5, I, W and M1 also have a 
predominantly European ancestry. 

Overall, it seems that at least 80% of Ashkenazi maternal 
ancestry is due to the assimilation of mtDNAs indigenous to 


Europe, most likely through conversion. The phylogenetic 
nesting patterns suggest that the most frequent of the Ashkenazi 
mtDNA lineages were assimilated in Western Europe, ~2 ka or 
slightly earlier. Some in particular, including N1b2, Mlalb, K1a9 
and perhaps even the major Klalbl, point to a north 
Mediterranean source. It seems likely that the major founders 
were the result of the earliest and presumably most profound 
wave of founder effects, from the Mediterranean northwards 
into central Europe, and that most of the minor founders 
were assimilated in west/central Europe within the last 1,500 
years. The sharing of rarer lineages with Eastern European 
populations may indicate further assimilation in some cases, but 
can often be explained by exchange via intermarriage in the 
reverse direction. 

The Ashkenazim therefore resemble Jewish communities in 
Eastern Africa and India, and possibly also others across the Near 
East, Caucasus and Central Asia, which also carry a substantial 
fraction of maternal lineages from their ‘host’ communities! !?5, 
Despite widely differing interpretations of autosomal data, these 
results in fact fit well with genome-wide studies, which imply a 
significant European component, with particularly close 
relationships to Italians>+%7. As might be expected from the 
autosomal picture, Y-chromosome studies generally show the 
opposite trend to mtDNA (with a predominantly Near Eastern 
source) with the exception of the large fraction of European 
ancestry seen in Ashkenazi Levites??. 

Evidence for haplotype sharing with non-Ashkenazi Jews for 
each of the three main haplogroup K founders may imply a 
partial common ancestry in Mediterranean Europe for Ashkenazi 
and Spanish-exile Sephardic Jews, but may also, at least in part, be 
due to subsequent gene flow, especially into Bulgaria and Turkey, 
both of which witnessed substantial immigration from Ashkenazi 
communities in the fourteenth and fifteenth centuries. Gene flow 
could have been substantial in some cases—ongoing intermar- 
riage is likely when these communities began living in closer 
proximity after the Spanish exile. A partial common ancestry for 
all European Jews—both Ashkenazi and Sephardic—is again 
strongly supported by the autosomal results**, 
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Jewish communities were already spread across the Graeco- 
Roman and Persian world > 2,000 years ago. It is thought that a 
substantial Jewish community was present in Rome from at least 
the mid-second century BCE, maintaining links to Jerusalem 
and numbering 30,000-50,000 by the first half of the first century 
CE!5. By the end of the first millennium CE, Ashkenazi 
communities were historically visible along the Rhine valley in 
Germany**. After the wave of expulsions in Western Europe 
during the fifteenth century, they began to disperse once more, 
into Eastern Europe?’ 

These analyses suggest that the first major wave of assimilation 
probably took place in Mediterranean Europe, most likely in the 
Italian peninsula ~2ka, with substantial further assimilation of 
minor founders in west/central Europe. There is less evidence for 
assimilation in Eastern Europe, and almost none for a source in 
the North Caucasus/Chuvashia, as would be predicted by the 
Khazar hypothesis®?—rather, the results show strong genetic 
continuities between west and east European Ashkenazi 
communities!®, albeit with gradual clines of frequency of 
founders between east and west!” (Supplementary Note 2). 

There is surprisingly little evidence for any significant founder 
event from the Near East. Fewer than 10% of the Ashkenazi 
mtDNAs can be assigned to a Near Eastern source with any 
confidence, and these are found at very low frequencies (Fig. 2). 
The most frequent, belonging to HV1b2, ROala and U7, are 
found at only ~3, 2 and 1% respectively. All are widespread 
across Ashkenazi communities, and might conceivably be relicts 
of early Levantine founders, but it seems likely that other more 
minor Near Eastern lineages are the result of more recent gene 
flow into the Ashkenazim. 

The age estimates for the European founders might suggest 
(very tentatively, given the imprecision with present data) that 
these ancestral Jewish populations harboring haplogroup K and 
especially N1b2 may have had an origin in the first millennium 
BCE, rather than in the wake of the destruction of the Jerusalem 
Temple in 70 CE. In fact, some scholars have argued from 
historical evidence that the large-scale expansion of Judaism 
throughout the Mediterranean in the Hellenistic period was 
primarily the result of proselytism and mass-conversion, 
especially amongst women’. We anticipate that a combination 
of large-scale mitogenome and whole Y-chromosome analysis, 
complementing full human genome sequencing, will be able to 
address this question in much finer detail in the near future. 

Despite the potential of genomic studies, the particular value of 
full-mitogenome sequencing should be stressed, as some studies 
dismissed the value of uniparental markers because of the impact 
of drift in the Ashkenazim®. In fact, the reverse may be the case: 
autosomal studies may be confounded by drift whereas the fine 
genealogical resolution of full mitogenomes, given sufficient 
sampling, can provide a detailed reconstruction of the history of 
Ashkenazi women. The mtDNA genealogy may even be 
considered to have particular relevance due to the matrilineal 
inheritance found in Judaism since at least ~ 200 CE and possibly 
several centuries earlier, helping to ‘fix’ incoming lineages from 
converts within the Ashkenazi community after this time. With 
sufficient resolution, a detailed genealogical history for every 
maternal lineage in the Ashkenazim is now within reach; in fact, 
it should soon be possible to reconstruct the outlines of the entire 
dispersal history of each community. 


Methods 

Samples and analysis of mtDNA sequence variation. Although there is a 
growing database of whole mitogenomes, almost all those from haplogroup U8 are 
from Europeans or individuals of European (predominantly west European) 
ancestry. Yet evidence from the Near East is critical in drawing up a meaningful 
picture of European (and wider west Eurasian) demographic prehistory. We 


therefore selected 67 predominantly Near Eastern haplogroup K samples (identi- 
fied by full control-region sequencing of 111 haplogroup K samples) for mito- 
genome sequencing, plus five samples belonging to non-K U8 and two from Italy 
potentially belonging to N1b2 (Supplementary Data 1). We collected samples with 
the appropriate informed consent of the subjects and the work was approved by the 
University of Leeds, Faculty of Biological Sciences Ethics Committee, the Ethics 
Committee for Clinical Experimentation at the University of Pavia, and the 
Western Institution Review Board (WIRB), Olympia, WA, USA. We sequenced 
them using Sanger sequencing*?*4 and, to maximize the number of samples, we 
performed a phylogenetic analysis alongside 884 published U8 sequences (a total of 
909 belonging to haplogroup K) (Supplementary Data 1) and four haplogroup N 
outgroup sequences, using Network 4.6 software and the reduced-median algo- 
rithm>. We then constructed a putative most-parsimonious tree of the 956 U8 
sequences by hand from the network, following PhyloTree*© for known subclades. 
We used mtDNA-GeneSyn*’ to convert files. As there are a number of extremely 
variable sites in K1 (positions 195 and 16,093 in particular), we confirmed the 
overall topology by running networks of coding-region data only. We performed 
similar analyses for haplogroups H, J and T, and for N1b we augmented our 
previously published tree~*. 


Age estimates and phylogeographic distribution. We estimated coalescence 
times of clades, using the p statistic and ML?®39, with Bayesian estimations for 
mitogenomes using BEAST“. For the p statistic and ML, we corrected for 
purifying selection using the calculator we developed previously (Supplementary 
Data 4)°8. We defined some sub-haplogroups to be a priori monophyletic in the 
analysis (U8, U8a, U8b, K, K1, Kla, K1b, K1c, K2, K2a and K2b) and assumed a 
generation time of 25 years*!. We also obtained Bayesian skyline plots*?~“* to 
estimate ‘haplogroup-effective’ population sizes associated with U8 over time, and 
estimated the period of maximum growth*?. 

For a broader overview of the diversity and geographic distribution of lineages, 
we also compiled 1,917 haplogroup K HVS-I (hypervariable sequence I) sequences 
(in the range 16,051-16,400), 87 from U8a and 52 from U8b1 (from Europe, the 
Near East and North Africa, from a total database of 33,127 HVS-I sequences) 
(Supplementary Tables S1 and S2). We displayed frequency and diversity 
distributions of haplogroups K, U8al and U8b1 sequences, identified from their 
motifs in the HVS-I database, on interpolation maps using Surfer. For the 
frequency analyses, we analysed the data at the level of published regional 
populations; for the diversity analyses we aggregated them into broader areas, as 
described in Supplementary Table S2. For the analyses of other Ashkenazi lineages 
we compared 836 published control-region sequences!!! with available 
Ashkenazi whole mitogenomes and the global mitogenome database available on 
GenBank, in order to assign the Ashkenazi control-region lineages to subclades. 
For geographic distributions, we supplemented and checked this information 
against a database of control-region data (38,244 records from west Eurasia, 
Central Asia and North Africa). 
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